16 research outputs found

    An End-to-End Multi-Task Learning Model for Image-based Table Recognition

    Full text link
    Image-based table recognition is a challenging task due to the diversity of table styles and the complexity of table structures. Most of the previous methods focus on a non-end-to-end approach which divides the problem into two separate sub-problems: table structure recognition; and cell-content recognition and then attempts to solve each sub-problem independently using two separate systems. In this paper, we propose an end-to-end multi-task learning model for image-based table recognition. The proposed model consists of one shared encoder, one shared decoder, and three separate decoders which are used for learning three sub-tasks of table recognition: table structure recognition, cell detection, and cell-content recognition. The whole system can be easily trained and inferred in an end-to-end approach. In the experiments, we evaluate the performance of the proposed model on two large-scale datasets: FinTabNet and PubTabNet. The experiment results show that the proposed model outperforms the state-of-the-art methods in all benchmark datasets.Comment: 10 pages, VISAPP2023. arXiv admin note: substantial text overlap with arXiv:2303.0764

    シンソウニューラルネットワークニヨルテガキテキストニンシキ

    Get PDF
    博士(工学)東京農工大

    TabIQA: Table Questions Answering on Business Document Images

    Full text link
    Table answering questions from business documents has many challenges that require understanding tabular structures, cross-document referencing, and additional numeric computations beyond simple search queries. This paper introduces a novel pipeline, named TabIQA, to answer questions about business document images. TabIQA combines state-of-the-art deep learning techniques 1) to extract table content and structural information from images and 2) to answer various questions related to numerical data, text-based information, and complex queries from structured tables. The evaluation results on VQAonBD 2023 dataset demonstrate the effectiveness of TabIQA in achieving promising performance in answering table-related questions. The TabIQA repository is available at https://github.com/phucty/itabqa.Comment: First two authors contributed equall

    Network Coding with Multimedia Transmission and Cognitive Networking: An Implementation based on Software-Defined Radio

    Get PDF
    Network coding (NC) is considered a breakthrough to improve throughput, robustness, and security of wireless networks. Although the theoretical aspects of NC have been extensively investigated, there have been only few experiments with pure NC schematics. This paper presents an implementation of NC under a two-way relay model and extends it to two\ua0non-straightforward scenarios: (i) multimedia transmission with layered coding and multiple-description coding, and (ii) cognitive radio with Vandermonde frequency division multiplexing (VFDM). The implementation is in real time and based on software-defined radio (SDR). The experimental results show that, by combining NC and source coding, we can control the quality of the received multimedia content in an on-demand manner. Whereas in the VFDM-based cognitive radio, the quality of the received content in the primary receiver is low (due to imperfect channel estimation) yet retrievable. Our implementation results serve as a proof for the practicability of network coding in relevant applications

    Network Coding with Multimedia Transmission and Cognitive Networking: An Implementation based on Software-Defined Radio

    Get PDF
    Network coding (NC) is considered a breakthrough to improve throughput, robustness, and security of wireless networks. Although the theoretical aspects of NC have been extensively investigated, there have been only few experiments with pure NC schematics. This paper presents an implementation of NC under a two-way relay model and extends it to two non-straightforward scenarios: (i) multimedia transmission with layered coding and multiple-description coding, and (ii) cognitive radio with Vandermonde frequency division multiplexing (VFDM). The implementation is in real time and based on software-defined radio (SDR). The experimental results show that, by combining NC and source coding, we can control the quality of the received multimedia content in an on-demand manner. Whereas in the VFDM-based cognitive radio, the quality of the received content in the primary receiver is low (due to imperfect channel estimation) yet retrievable. Our implementation results serve as a proof for the practicability of network coding in relevant applications

    Prospects for Food Fermentation in South-East Asia, Topics From the Tropical Fermentation and Biotechnology Network at the End of the AsiFood Erasmus+Project

    Get PDF
    Fermentation has been used for centuries to produce food in South-East Asia and some foods of this region are famous in the whole world. However, in the twenty first century, issues like food safety and quality must be addressed in a world changing from local business to globalization. In Western countries, the answer to these questions has been made through hygienisation, generalization of the use of starters, specialization of agriculture and use of long-distance transportation. This may have resulted in a loss in the taste and typicity of the products, in an extensive use of antibiotics and other chemicals and eventually, in a loss in the confidence of consumers to the products. The challenges awaiting fermentation in South-East Asia are thus to improve safety and quality in a sustainable system producing tasty and typical fermented products and valorising by-products. At the end of the “AsiFood Erasmus+ project” (www.asifood.org), the goal of this paper is to present and discuss these challenges as addressed by the Tropical Fermentation Network, a group of researchers from universities, research centers and companies in Asia and Europe. This paper presents current actions and prospects on hygienic, environmental, sensorial and nutritional qualities of traditional fermented food including screening of functional bacteria and starters, food safety strategies, research for new antimicrobial compounds, development of more sustainable fermentations and valorisation of by-products. A specificity of this network is also the multidisciplinary approach dealing with microbiology, food, chemical, sensorial, and genetic analyses, biotechnology, food supply chain, consumers and ethnology

    Recognition of Anomalously Deformed Kana Sequences in Japanese Historical Documents

    No full text

    013 Ⅱ論考編 研究成果の活用と関連研究 変体仮名の認識

    No full text
    2013年度~2017年度科学研究費補助金基盤研究(S) 研究成果報告書(課題番号25220401

    Doubly committed ventricular septal defect: Is it safe to perform surgical closure through the pulmonary trunk approached by right vertical axillary thoracotomy?Central MessagePerspective

    No full text
    Objective: This study investigated the safety of performing surgical repair for doubly committed ventricular septal defects by right vertical infra-axillary minithoracotomy (RVIAT). Methods: A retrospective comparative study was performed to evaluate the outcomes of patients who underwent doubly committed ventricular septal defects closure from January 2019 to May 2022. Seventy-four patients were enrolled in the study and treated with either the median sternotomy approach (MSA: n = 37) or the RVIAT approach (RVIAT: n = 37). Results: The median weight and age in the MSA group were significantly lower than those in the RVIAT group (MSA: 6.0 kg [interquartile range] (IQR), 5.2 to 8.7 kg] vs RVIAT: 7.5 kg [IQR, 5.6-14 kg]; P = .034 and MSA: 4.9 months [IQR, 3.6-9.4 month] vs 9.6 months [IQR, 5.0-60.4 months]; P = .0084). No patients died, and no patients in the RVIAT group required conversion to the MSA approach. The mean prebypass surgical time was longer in the RVIAT group (36.1 ± 8.2 minutes vs 31.8 ± 5.6 minutes; P = .03). There were no significant differences between the 2 groups in cardiopulmonary bypass time, aortic crossclamp time, or operation time. Significantly shorter ventilation times were observed in the RVIAT group (11.9 ± 8.2 hours vs 15.4 ± 6.3 hours; P = .006). Conclusions: Closure of doubly committed ventricular septal defects through the pulmonary trunk by the RVIAT approach is feasible and safe, and does not increase the risk of bypass-related complications
    corecore